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ROBUST  OPTIMUM  INVARIANT  TESTS  OF  COVARIANCE 
STRUCTURES  USEFUL  IN  LINEAR  MODELS 


1 .  Introduction . 

In  this  paper  we)  investigates/ robust  optimum  invariant 

tests  of  some  covariance  structures  that  naturally  arise  in 

the  context  of  robustness  study  in  linear  models.  The 

concept  of  robustness  in  connection  with  linear  models  is 

entirely  different  from  the  notion  prevalent  in  multivariate 

analysis  (vide  Kariya  and  Sinha  (1985))  and  refers  to  the 

2 

structures  of  X  and  V  in  the  model  (Y,  XB,  a  V)  in  contrast 

to  the  distribution  of  Y.  Here  X  is  known  as  the  design 
2 

matrix  and  a  V  the  variance-covariance  matrix  of  Y. 

■  (s  ,,  ■■  ^ 

v  To  describe  this  concept,  let  (Y,  XB,  q*T)  be  the 
assumed  (probably  incorrect)  model  while  (Y,  XB ,  o^V)  be  the 
correct  model,  resulting  in  the  specification  error  in  the 
dispersion  matrix.  Then  it  is  well  known  that  the  BLUEs  of 

.  I  '  *• 

all  estimable  linear  parametric  function  AB  remain  the  same 
under  both  the  models  if  and  only  if  the  following  condition 
holds  on  the  structure  of  V: 

i  1 

<;  X'VZ  =  0">  (1.1 

where  Z  denotes  a  matrix  of  maximal  rank  satisfying  the 

f 

condition  Z'X  =  0.  This  result,  in  various  equivalent  forms 
appears  in  Rao  (1967),  Zyskind  (1967),  Rao  and  Mitra  (1971), 


Mathew  and  Bhim^sankaram  (1983)  ,  and  also  in  Sinha  and 

Drygas  (1983) .  Our  object  is  to  test  the  null  hypothesis 

/'  V  y 

that  V  possesses  the  structure  s a ti s f y irrrg— HrrT)  based  on 

samples  on  Y  under  the  model  (Y,  X 6,  o?V)  for  a  fixed  design 

_fv--  •%) 

matrix  X.  This  hypothesis  is  of  considerable  interest  as 
its  acceptance  greatly  simplifies  determination  of  BLUEs  of 
estimable  linear  parametric  functions. 

Below  we  work  with  a  canonical  form  of  this  problem 
which  is  now  developed.  Let  Y:  n  x  1,  X:  n  x  k  with  rank  (X) 
=  r  <  k,  so  that  X  =  X°C  for  some  X°:  n  x  r  of  rank  r 
and  for  some  C:  r  x  k  of  rank  r  .  Consequently  the  matrix 
Z:  n  x  (n-r)  which  satisfies  Z'X  =  0  also  satisfies  Z#X°  =  0. 
It  is  then  clear  that  the  condition  (1.1)  is  equivalent  to 


X°  VZ  =  0 


(1.2) 


Defining  Y(i)  =  Z'Y  and  Y(2)  =  xgY  and  making  the  1:1 
transformation  Y  ■*  (Yjij,  '^(2))^  it  then  follows  that  the 
condition  (1.2)  is  equivalent  to  testine  the  hynothesjp 
that  Y(i)  and  Y(2)  are  uncorrelated.  If  Y  is  assumed  to  be 
normally  distributed,  this  is  the  familiar  problem  of  test 
of  independence  of  two  random  vectors  ^(l)  and  Y(2)  with  the 
added  restriction  that  EY { l )  =  0  since  Z'X°  =  0.  This 

problem  is  analyzed  via  invariance  in  the  next  section  where 
normality  of  the  underlying  data  matrix  is  replaced  by  an 
elliptically  symmetric  distribution. 


There  is  another  form  of  robustness  in  linear  models 


in  connection  with  tests  of  estimable  linear  parametric 

functions.  To  describe  this  briefly,  consider  the  problem 

o 

of  testing  Hq :  AB  =  0  under  the  assumed  model,  (Y,  XB  ,  a  I), 
2 

a  unknown,  and  Y  is  distributed  normally.  Here  AS  is 

estimable  and  hence  testable.  It  is  well-known  that  the 

F-test  based  on  the  ratio  of  sums  of  squares  due  to  the 

hypothesis  and  due  to  the  error  is  both  LDT  and  UMPI  under 

a  suitable  group  of  transformations  (vide  Lehmann  (1959)). 

The  answers  to  the  question  "Is  the  F-test  (given  above) 

2 

under  the  model  (Y,  XB ,  a  I)  still  LPT  under  the  correct 

*> 

model  (Y,  XB ,  a^V)?"  have  been  put  forward  by  Khatri  (1980), 
Ghosh  and  Sinha  (1980)  and  Mathew  and  Bhimasankaram  (1983). 
It  turns  out  that  the  answer  is  in  the  affirmative  under 
the  following  condition  on  V; 


(I  -  px  )  V(I  -  Px  )  =  a(I  -  Px  ) 


(1.3) 


for  some  a  >  0  where  P^  =  A(A#A)  A',  (A'A)  is  a  generalized 

inverse  of  A'A,  and  Xq  =  X(I  -  a~A). 

The  second  object  in  this  paper  is  indeed  to  test  the  hy- 

2 

pothesis  that  V  in  the  model  (Y,  XB ,  o  V)  possesses  a  structure 
satisfying  (1.3),  for  a  fixed  linear  parametric  function  A' B , 
based  on  samples  of  Y  for  a  fixed  design  matrix  X.  Inci¬ 
dentally,  if  we  demand  (1.3)  to  hold  for  all  estimable  A* B , 
it  turns  out  that  V  =  I  is  the  only  matrix  satisfying  this 
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Aval  j. '<1  for 
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condition . 
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aa 


As  before,  here  again  we  work  with  a  canonical  form. 

Writing  I  -  PY  =  DD“  for  some  D:  n  x  y  of  rank  y  = 
o 

rank  (I  -  PY  )  and  noting  that  D'D  is  p.d.,  it  follows  easily 
xo 

that  (1.3)  is  eauivalent  to 


D'VD  =  al, 


for  some  a  >  0 


(1.4) 


Defining  now  W  =  D'Y,  it  follows  that  (1.4)  is  eauivalent 
to  testing  the  sphericity  of  W  with  the  added  restriction 
that  E(W(ij)  =  0  where  W  =  (W'^W'j))'.  This  follows  from 
the  fact  that  the  range  or  the  column  space  of  D  contains  a 
subspace  which  is  orthogonal  to  X.  This  problem  is  taken  up 
in  section  3  via  invariance  with  normality  of  Y  replaced  by 
elliptical  symmetry  of  Y. 


2.  Test  of  Independence 


The  canonical  form  of  this  problem  is  as  follows.  Based 

on  a  data  matrix  Y:  n  x  p  [  X  :  Z  ]  obeying  the  model 

nxpi  nxp2 


Y  =  [1  u'  :  0]  +  U J1/2  ,  M  £  RPl  ,  l  p.d, 


where  U  has  an  elliptically  symmetric  distribution  with 
density 


(2.1) 


-V/'/V  ■/’  .V  .V  .v  "■  .V  ,•  ^  ,n  ,v  .v.v.s , 


5 


f (u)  =  q(  tr  u"u ) 


for  some  q:  [0,  ®)  -*■  [0,  ®) 


(2.2) 


such  that 


/  q(  tr  u'u'Jdu  =  1 , 
Rnxp 


we  want  to  test  the  hypothesis  Hq:  £  =  0  vs.  H]_: 

£  t  0  .  Here  \  is  expressed  as 


l  - 


^11 

^21 

PI 


^12 

^22 

P2 


PI 

P2 


The  p.d.f  of  Y  can  be  written 


as 


f(Ylu,[)  =  lll“n/2  •  q(tr£~X(X  -  lu',Z)'(X  -  lyr,Z))  (  2.3) 


When  p  =  0  or  the  mean  of  Z  is  not  known  and  Y  is  normal, 
this  is  the  usual  problem  of  testing  independence  of  two  vec¬ 
tors  for  which  optimum  solutions  do  exist  in  the  literature. 
For  example,  Schwartz  (1967)  established  that  the  test  based 
on  tr  S  S~^S  S-'*'  is  IBI  in  general.  For  nonnormal  Y,  its 
null  and  optimality  robustness  under  certain  conditions  on 


q  are  established  in  Kariya  and  Sinha  (1985).  Of  course, 
when  p^  =  pg  =  1 ,  this  test  boils  down  to  the  ordinary  pro- 
dut  moment  correlation  test  and  becomes  UVPI .  Here  S 
denotes  the  sample  Wishart  matrix  based  on  Y  and  S  is 


•  .  ■  •  •  .  * .  •  ‘  .  *  -  •  .  '  ,  *  V s'\'l  s'  v*\  v’*» 


-r 

J" 


decomposed  as  S  =  T Sxx  Sx2 

J^S2X  szz 
Pi  P2 

We  shall  see  how  the  solution  changes  in  our  problem  because 
of  the  information  that  the  mean  of  Z  is  C.  We  mention 
that  under  the  assumption  of  normality  of  Y,  this  problem 
under  a  slightly  wider  framework  appears  in  Eaton  and 
Kariya  (1983). 

Before  we  discuss  this  problem  from  the  point  of  view 

of  invariance,  let  us  quickly  look  into  the  LRT.  Define 

x:  p  x  1,  z:  p  x  1  and  S  in  the  usual  fashion  and 
1  ~  2 

decompose  S  as  S  =  "sxx  SX2 

szx  szz 

* 

as  mentioned  before.  The  likehood  function  (2.3)  can  be 
written  as 

x-u 

f(U,£|Y)  =  1 1 1  ~n/2q(  n  r£-1{~_"")  ( (x -|i);z’’  )  +  tr£-1S)  (2.4) 

z 

Assuming  that  q(*)  is  a  nonincreasing  function  of  its 

A  A 

argument,  it  follows  that  the  MLE  y  of  y  satisfies  x  -  y 


sup  f(u,£|Y)  =  Ijl  /  q{ n  tr  \22  zz'  +  tr  \  S) 
U  '  ~~ 


Using  (Raof1^3)) 


Ell  El2 


•21  L22 


Ell. 2 _ "^11.2^12^22 

"^22^21^11.2  E 2 2  +  E 22^ 2lE 11. 2^12^22 


where 


•11.2 


Ell  "  E12  E 22  E 21  ' 


'I'  =  I l  'I l 

22  11.2 


we  get 


tr  X'  S  +  n  tr  l 


22  zz'  *  tr  l~ 2  (Slx  +  n  zz')  + 


tr  Eh.2(Sxx"  ll2^22Szx  "  Sxz^22^2l  +  ^12^22Szz^22^21] 


While  maximizing  the  likelihood  function  with  respect  to  £ 
we  consider  the  reparametr ization  £22'  In  2  an<^  ^22^21  = 
(say).  The  expression  in  (2.5)  in  terms  of  these  new 
parameters  can  be  written  as 


sup  f(|i,  £ly)  =  |J  |“n/2  |£  l~n/2 

22  11.2 

U 


<3(tri;i(S2  +n«-)+tri:}  ,(sYX.  +<5-S„ aS’  )Sa rM-Sx zSlV)) 


Clearly  this  attains  its  maximum  with  respect  to  5  when 


C  =  Sxzs  J  resulting  in 


sup  f ( u /  I22,  IX1  2,  Sly)  =  I l22 
Ur( 


-n/2  ,c  | -n/2 
*•11.2 


*(tr  I22  (Szz  +  n“'}  +  tr  G.2Sxx.«) 

Finally,  using  a  result  of  Anderson  and  Fang(i982)  we  know 
that  if  q  is  nonincreasing  and  differentiable  the  MLEs  of  ^ 
anc*  In.  2  are  given  by 


I22  =  Xmax(<5)  '  IS,,  * 


9 


•11.2  ~  Xmax^  ^xx*z 


where  Xmax(q)  is  the  solution  of  the  equation 


<*'«?>  +  T  -J(?>  *  »• 


For  example,  if  q(x)  =  e  2 ,  Xmax(q)  = 


Therefore,  we  have, 


,ly)  •  C,  <q)IS+n;;-rn/2ls„v.J 


‘22  L11 .2  * '  maxKHI  zz  ZZ  1  "xx-z 


-n/2 


( 2.10) 


1  ls«  +  n;;-rn/2  isvv. ,rn/2  qipx'Liqn 


Analogously,  under  the  null  hypothesis  Hq  :  --  0,  we  get. 


sup  f (u,£ !y)=\”^x(q) lSZ2+nzz' 


ISxx!  2c5(PXmax(<5)} 


(  2.11  ) 


In  the  above 


yielding  the  LRT  statistic  as  {IS  I/IS  I }n^2. 

we  have  assumed  that  q  satisfies  q(pX_1  (q))  <  «,  o  <  X  (q)  <  » 

max  max 

Remark  2.1:  It  may  be  noted  that  the  LRT  derived  above  is 
just  the  one  without  the  information  that  Z  has  mean  zero 
and  so  it  ignores  this  information.  When  Y  is  normally 
distributed,  this  is  derived  in  Lee  and  Geisser  (1972).  As 
noted  in  Eaton  and  Kariya  <1983>>  this  is  rather  surprising. 

Our  derivation  of  the  LBI  test  of  Hq :  =  0  versus 

Hi*  ll2  /  0  under  the  model  (2-3>  parallels  a  similar 
derivation  in  Kariya  and  Sinha  (1985)  where  Z,  like  X,  also 
is  assumed  to  have  an  unknown  mean  matrix  of  the  form  16' 
for  some  6  e  RP2.  We  restrict  our  attention  to  the 
likelihood  function  given  in  (2.4)  and  without  any  loss  of 
generality  due  to  the  invariance  of  the  problem  assume  that 
a)  x  -  y  =  0  in  (2.4)  and  b)  £  is  of  the  form 


T  =  (A,  0):  p]_  x  P2  with  A  =  diag(5]_,  $pj_)*  Here 

without  any  loss  of  generality  pi  <  P2  is  assumed.  The 
result  a)  follows  upon  noting  that  the  testing  problem 
remains  invariant  under  the  transformation  x  ■*  x  +  5  for 
5  e  Rpl,  that  the  left  invariant  measure  on  Rpl  is  Lebesgue, 
and  that  the  result  of  integrating  out  £  in  (2.4)  after 
the  substitution  x  -*•  x  +  5  (this  is  while  invoking  Wijsman's 


t  Jl  .<»  ,!>'  ,t| 


r. 


Representation  Theorem)  is  nothing  but  to  put  x  -  y  =  0  in 
(2.4),  multiply  the  right  hand  side  of  (2.4)  by 

2^^'  aPart  from  a  constant,  and  replace  q  by  some 
q:  [0,  ®)  -*•  [0,  ®)  satisfying  a  similar  integrability 
condition  as  q.  The  resultant  expression  for  (2.4)  is 
then  given  by  (writing  q  for  q) 


f(YlE)=ki£„rn/2|£i;L  2l"(n'1)/2q(n  tr^zz'  +  tr^S)  (2.1 


-ith  I  =  fl 


r  ,  where  k  is  a  constant. 


In  this  setup  the  problem  is  to  test  Hq:  A  =  0  versus 
Hi:  A  /  0.  It  is  easy  to  see  that  the  testing  prolem  in 
this  somewhat  reduced  form  remains  invariant  under  the  group 
G  of  transformations 


G  =  A  = 


\  O']  ,  Ai 

0  A2 
•  »  J 


e  Gi(p^),  with  the  group  action 


S  -*•  ASA' 


z  ■*  AjZ 


(  2.1 


A  left  invariant  measure  on  G  is  v(dg)  =  v1(dA1)v2(dA2) 

-(P:+l)/2 

with  v. (dA. )  =  lA.Ari  1  ,  i  =  1,  2, 


l  l 


and  the  inverse  of 
n  n+1 

2  2 

the  jacobian  of  transformation  is  given  by  I A1A£ I  •  |A2A'| 


Using  Wijsman's  Representation  Theorem 
dPA 

r^(y)  =  is  obtained  as 


(1967), 


the  ratio 


^A(y)  * 


-n 

2 

/  |£|  q(ntrz'A'A_z 

G8.(  Pl )  xGi  ( P2  )  " 


J  q(ntrz'A'A_z 

GMPl)xGt(p2)- 


n-P1“l  n-p2 

+  tri^ASA')  |  AxA£  |  2  |A2A2I  2  dAxdA2 


n-P1-l  n-p2 

+  tr J_1ASA- ) I AxAJ |  2  |A2A2I  2  dAxdA2 

(2.14] 


Remark  2.2;  It  is  clear  that  an  optimum  test  of  HQ  versus 
Hi  is  obtained  by  examining  the  behavior  of  the  ratio  r^(y) 
as  a  function  of  A.  In  the  special  case  when  z  is  absent  in 
this  ratio,  it  is  proved  in  Kariya  and  Sinha  (1985)  that  there 
exists  a  UMPI  test  if  p1  =  p2  =  1  and  an  LBI  test  in  general. 
However,  with  the  presense  of  z  in  this  ratio,  there  does 
not  exist  a  UMPI  test  even  when  p,  =  p_  = 


1.  This  is 


observed  in  Eaton  and  Kariya 
normally. 


(1983)  when  Y  is  distributed 


Now  to  derive  an  LBI  test,  we  proceed  as  follows? 
Straightforward  computations  yield 


and  with  V  =  (A:0),  and  I  I A I j  2  =  £  6?  small, 

i  =  l  1 


( I  -  AA')_1  =  ( I  -  A'A)"1  =  I  +  AA'  +  0(  I  I A I  I  2 )  ( 

P1  P1  P1 


(I  -  AA')_1A  =  A  +  o ( I  I  A  I  I  2 )  =  A ( I  -  A'A) 
P1  P1 


I  +  A ( I  -  A'A) _1A'  =  I  +  AA'  +  o (  I  I  A  I  I ) 2 


Also  we  have. 


tr  ASA'  =  tr  A1S11A'  +  tr  A2S22A' 


so  that 


trn  2 'A2A2z  +  tr  ASA'  =  tr  A^S^A^  +  tr  A2 (S22+nzz' )A' 


Finally,  using  (2.15)  and  (2.16),  we  get 


n  tr  z'A'A2  z  +  tr  £-1  ASA' 


=  [tr  A1S11A'+tr  A2(S22+nzz')A' ]  +  tr(£  1-Ip)ASA 


while 


*  tr  {(' 

% 

<3 

-r  ' 

'  +  o*( 1 1 A 1 1 2) ) 

ViiAl 

A1S12A2 

-r 

r'r_ 

A2S21A' 

A2S22A2. 

=  tr  AA'A1S11A'  -  2tr  TA2  S2]A'  +  tr  T'T  A^^  + 
o  (IIAII2)  {tr  ASA') 

where  o*(||A!l2)  is  a  matrix  of  order  p  x  p  all  of  whose 
elements  are  o(||A||2).  We  now  make  the  transformation 

A1S112  A1  '  A2(S22  +  n  K')1/2  A2  (2.22) 

This  reduces  r^{y)  to 


^A(y)  * 


-n 

2 


I l I  J  q( tr  A. A'+trA-A^J+trAA'A. A'-2trrA,WA'  + 

G4  ( px  )  xGS,  {  p2  ) 1  1  11  1  . 

n-p1~l  n-p2 


trr'TA2VA'+o(  II A|  I 2) ) |AXA' I  2  J  | 


dA2dA 


where 


V  =  (S22  +  n  zz')~1/2  S22(S22  +  n  zz')'1/2 

W  =  (S22  +  n  zz'}‘1/2  s2isii/2  ' 

and  the  term  o( I  I  All  2)  is  uniformly  so  in  Y.  This  is  because 
both  V  and  W  satisfy  I |W'W| I  <1,  I IVI I  <1. 

We  now  expand  the  numerator  of  r^(y)  around  A  =  0  using 
standard  Taylor  expansion.  Towards  this  end,  we  assume  that 
q  is  thrice  continuously  differentiable  and 

n-l~P1  n-p2 

J|qU){tr  AjA'  +  tr  A^ljl^A'I  2  I A2A'  I  2  I  tr  PA2QA'|21l  +  1 

( 2.23) 

Itr  PP'AjA'  +  tr  RR'A2A'  1 3  dAj^dA2  <  <»  > 

for  Z  =  0,1,  P:  p:  x  p2,  Q:  p2  x  p1 ,  R:  p2  x  p± 

where  qx(x)  =  -  ,  i  =  1,  2,  3. 

dx1 

Then, 

q[ { tr  A.Ar  +  tr  A-Al)  +  tr  AA'A.A'  -  2tr  V  A.WA'  +  (2.24) 


tr  r*r  a2va2  +  o ( i i a i  r>  j 

=  q( tr  AjAj  +  tr  A2A')  +  S(A:A)q(1) (tr  AjA'  +  tr  A2A') 

+  |  {6(A:A)}2q(2)(tr  A^A'+tr  AjAJ  )  +  |  {5(A:A)}3- 
q(3)[(tr  AjA'  +  tr  A2A')  +  (1-a)  6 ( A: A ) J  ,  0  <  a  <  1 

where 

6 ( A: A )  =  tr  AA'AjA£-2tr  V  A2WA£+tr  T'T  A2VA'  +  o(llA||2) 

To  evaluate  the  integrals  of  these  terms  over  GJl(pi)  x 
Gfc(P2)»  we  note  the  fact  that  the  integrals  of  odd  functions 
of  A^  and  A2  are  zero  because  the  integrals  are  finite  by 
our  assumption  (2.23).  Moreover, 

J  (tr  r'r  A.vA;)q(1) (tr  a, a:  +  tr  a.a;)  <  2.25) 

G4 (Px ) xGi ( p2 )  1  11  ’ 

n-l-p1  n-p2 

1 Aia£ I  2  IA2A'|  2  dA1dA2=c1(q) (trr'D (trV)=c1(q) (trA'A) (trV) 
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where 


)q(1) (trA1A£+trA2A') . 

(2.26) 

n-l-p1  n-p2 

I A1A£ |  2  *  IA2A'|  2  dALdA2 

and 


J  (tr  T  A-WA')2q(2) (tr  A.Af  +  tr  A2A')  .  (2.27) 

G4(p1)xGJl(p2) 

n-l~P1  n-p2 

I AxA£ 1  2  lA2A'|  2dA1dA2=c2(q) (trr'T) (trWW')=c2(q) (trA'A) (trWW') 


Cl(q)  pxp2  G4(pi{xGj[fp^)Pi 


:°]A2A'rip- 


where 


c2(q)=^2  j  <tr  :0)A2fl  Ta')V^  ( trA.  AfftrA-A; ) 

Pl  Gl(Pl)xG!l(p2)Pl  2|  Pl|  1  11  22 


|A1A1 


n-l“P1 
-» ,  2 


(2.28 


•  IA2A2I  dAxdA2 


(2.25)  and  (2.27)  can  be  proved  along  the  same  lines  as 
in  Kariya  (1978),  Eaton  and  Kariya  (1983),  and  Kariya  and 
Sinha  (1985).  The  expressions  for  and  c2  and  Y  is 

normally  distributed  appear  in  Eaton  and  Kariya 
are  given  by  c*  =  -  c*  =  ^n~1}  . 

Additionally,  we  get 


/  (tr  AA'A. A'+trr'TA.VA;) 2q ( 2) (trA. A'+trA-Al) 
GJl  (p^ )  xG2.  ( p2 )  x  1  22  11  1  l 


(  2.29) 


n-l-r 


n-p2 

~2 


1 AiA£ I  2  !A2A2I  dA1dA2  =  o (  I  I  A  I  I  2 )  ; 


J  (tr  AA'A,A.;  +  trr'rA_VA;)  (trTA_WA' )2q(2)  (trA.Ar  +  trA^Al) 
GMp1)xG)l(p2)  x  x  22  21  11  22 


(  2.30) 


n-l-c 


n-p2 

~2 


I AXA£ I  2  IA2A2I  dAxdA2  =  o(  MAI  I2) 


■%  n  pvt 


and  so  on.  These  results  follow  primarily  because  V  and  W 
are  bounded  in  norm  as  mentined  before  and  the  integrals 
involved  are  finie  by  our  assumption  (  2.23). 


We  are  now  ready  to  collect  all  the  different  terms 
arising  out  of  the  integrals  of  the  expression  in  (2.24). 

A  straightforward  computation  shows  that  the  ratio  r^(y)  in 
(  2.22)  is  given  by 

r^(y ) =  1 1 1 ~n/2[ l+( trAA' ) {trV)c1+c2(trWW')}+o( I  I  A  I  I  2 ) ]  (2.3 


=  1+ ( tr  AA'){c1(tr  V)  +  c2(tr  WW' )  +  |  }  +  o ( | I  A  I  I  2  ) 


since 


II  -  AA'|'n/2 
pl 


1  +  |  tr  AA'  +  o (  I  I  A  I  I  2 ) 


A  simple  application  of  the  Neyman-Pearson  Lemma  then 
yields  the  following  result. 

Theorem  2.1:  For  testing  HQ :  £12  =  0  vs  H1 :  J12  f  0  under 
the  model  (2.1)-(2.2),  the  test  which  rejects  HQ  for 
large  values  of  C]_{q)tr  V+  C2(q)  tr  WW^  is  LBI  for  a  given  q 
satisfying  Assumption  (2.23). 


Remark  2.3:  When  V  is  absent,  the  LBI  test  statistic  coincides 


with  the  popular  expression  tr  S22S21S11S12  §iven  bY  Kariya  and 


Sinha  (1985),  and  represents  a  robust  LBI  test  for  all  q  satis¬ 
fying  Assumption  (2.23).  When  V  is  present  but  Y  is  normal, 
this  expression  is  the  same  as  in  Easton  and  Kariya  (1983)  (their 
equation  (4.6)). 


Remark  2.4:  The  LBI  test  statistic  derived  for  a  specific  q 


remains  robust  for  q  e  Q,  a  class  of  densities  satisfying 

c,  (q) 

Assumption  (2.23),  whenever  — — r-  is  a  constant,  independent  of 


q.  It  is  easy  to  verify  that  for  normal  variance  mixtures 
-i-tr  u'u/w 

- - -  -Tr, . dG(w)  ,  c,  (G)  and  c9(G)  are  given  by 

o  (277)np/  wp/^  1  * 


f(u)  = 


*  * 

c^(G)  =  c1 ,  c2(G)  =  c2>  independent  of  G.  Hence  the  LBI  test 


is  optimality  robust  at  least  for  arbitrary  normal  variance  mix¬ 
ture  family.  The  null  robustness  of  the  LBI  test  in  this  case 
follows  easily  from  Kariya  (1981). 


3 .  Testing  Sphericity 

The  canonical  form  of  this  problem  is  identical  with  that  in 

2 

Section  2.  However,  here  we  are  testing  H  •  T  =  a  I  versus 

U  L  p 

2  2 

Hi :  £  f  o  Ip,  a  >0  unknown.  When  the  mean  of  X  is  also  zero  or 
the  mean  of  Z  is  unknown,  this  is  the  well  known  problem  of  test¬ 
ing  sphericity  for  which  optimum  tests  are  derived  in  Sugiura 
(1972)  under  the  assumption  of  normality  of  Y,  and  in  Kariya  and 
Sinha  (1985)  under  a  more  general  distribution  of  Y.  Here,  as  in 


Section  2,  only  one  of  the  means  is  unknown  and  we  shall  see  the 
solution  changes  drastically. 


Before  we  employ  the  principle  of  invariance  in  an 
attempt  to  derive  an  optimum  invariant  test,  here  also 
we  first  derive  the  LRT.  The  likelihood  function  appears  in 
(2.4)  and  its  unconstrained  supremum  is  given  in  (2.10)  of 
Section  2.  Under  the  null  hypothesis  Hq  :  £  -  a  l  ,  a  >  0 
unknown,  (2.5)  reduces  to 

sup  f  ( u,  a2ly)  =  (a2)  np//2q(tr(n  zz'  +  s)/o2)  (3.1) 

U 

Finally,  using  a  version  of  the  same  result  by  Anderson  and 
Fang (1982)  mentioned  in  Section  2,  we  know  that  if  q  is 
nonincreasing  and  differentiable  the  MLE  of  a 2  is  given  by 

a2  =  0 { q )  tr ( n  zz'  +  s) 


where  0(q)  is  the  solution  of  the  equation 


q' 


o 


For. example,  if  q(x) 


Hence 


sup  f  (  y ,  a2  I  Y)  =  {  0  (q) }  ~np/2  { tr  ( n  22  '+s ) }  ”np/2q(  ) 

P/a2 

~H„ 


(3, 


Comparing  (2.10)  and  (3.2),  the  LRT  criterion  X  is 
obtained  as 


XX  *2 


S  +  n  22 ' 
22 


X  a  [  • 


n 

,2 


<tr(Sxx  +  S22  +  n  IV )}1 


(3.: 


Here  also  we  have  assumed  that  X  (q)q(pX  1  ( q ) )  <  ®  and 
_  max  ^  max  ’ 

-np 


9(q)  2  qlg^)  < 


Thus  the  LRT  criterion  remains  robust  as  long  as  q  is 
nonincreasing  and  differentiable. 

To  derive  an  optimum  invariant  test,  we  note  that  the 
testing  problem  Hq :  \  =  o2I  versus  Hi:  \  /  o2I,  o2  >  0, 
unknown . under  the  model  (2.1)-(2.2)  remains  invariant 
under  the  group  G  of  transformations  G  =  R+  x  rPI  x  0  (p^)  x 


l\  2  l  (1  +  F*)~np/2  v(dH) 

e  (p) 


(3.8) 


where 


F*  = 


tr(£_1  -  I  )HSH‘ 


tr  S 


and  H  e  0(p) 


However,  in  our  problem,  because  of  presence  of  z 
and  the  structure  of  the  joint  density  in  (2.4),  H  has 
to  be  taken  as  a  block  orthogonal  matrix  given  in  (3.7) 
above . 


Remark  3.1:  It  is  interesting  to  observe  that  the  ratio 


in  (3.5)  is  independent  of  q.  This  implies  that  any 


null  robust  invariant  test  is  automatically  nonnull  robust. 
Also,  the  optimality  robustness  of  an  invariant  test  follows 
trivially. 

The  crux  of  the  problem  now  is  to  expand  the  R.H.S.  of 
(3.5)  in  \  locally  around  the  null  hypothesis  HQ.  Because 
of  the  invariance  of  the  problem,  we  assume  without  loss  of 
generality  that  \  is  of  the  form 


$ 

.<1 


l=Ai  \l2  ,  K1  =  diag(X^)  ,  A2  =  diag(6i)  (3 

[ijl  A2  J 

and  that  the  null  hypothesis  is  specified  by  Hqj  £  =  Ip. 
Local  alternatives  are  fixed  by  choosing  e  >  0  small  and  a 
suitable  matrix  A  and  setting  £  =  Ip  +  eA.  Writing 


pll 

A12I 

}_a21 

*22j 

All 

and  A22 

^2 

+  £ A22 1 

A”1 

—  T  _ 

A1 

'  Pi 

A~1 

—  T 

A2 

"  P2 

2.2  ,2 
L11  +  e  A11  +  °*e 


2  2 

k22  +  £  A22 


<A1  '  *  <  V  £lH  '  £"412A^A21> 


2.  .-1.  .-1 


=  Ip.  -  c4ll  *  e2(A11  -  *12*21 >  *  0(e2> 


A2  ^ 2 1  ^ A1  "  £l2A2  ^ 21  ^  X 1 2* 2  ~  e  A21A12  +  °^£  ^ 


A2  ^21*A1~  ^12A2  ^ 21 ^  *  eA21~  £  ^A22A21+  A21All*+°*e  * 


Using  (2.6)  and  (3.10),  the  expression  for  F  in 


(3.6)  is  simplified  as 

F  =  (trS+trV)_1{tr(^_1-Ip)HSH>  +  fc r ( I22  “  Ip  ^H2VH2^  (3, 

=  (trS  +  trV) -1  {-e  [  tr  4nH1SnHj  +  tr  A22H2(S22+  V)H2 
*  2  tr  412h2S2]H;1  +  r2[tr(4^  -  t 

+  “  422H2<S22  *  V'H2  +  tr  421412H2S22H2  * 

+  2  tr ( +  ^21^11^1^12^2^  +  Oy(£  ) 

where  the  last  term  Oy(e^)  in  (3.11)  is  uniformly  o(e  )  in  y 
We  now  use  the  following  facts  (Kariya  (1985)): 

(a)  J  tr(AQBQ')  v(dQ)  =  tr-A^tr  B 

0(p)  p 

(b)  /  (tr  AQBQ' )  2  v(dQ)  =  B2) 

r\t  \  P  \  P  r  ±  ) 


(tr  AQBQ " ) J ( 1  +  0  trAQBQ') 


v(dQ)=o( trA*) • ( trBJ) 


(c)  J 


0(P) 


(d)  J'  tr(AQ)  v(dQ)  =  0 
0(P) 


for  0  <  9  <  1 ,  y>0  and  A  close  to  the  null  matrix  in  (c).  Tb 
terms  o(tr  A2)  in  (c)  above  is  uniform  in  the  elements  of  B. 


We  are  now  in  a  position  to  evaluate  the  R.H.S.  of 


(3.6).  Expanding  (l+F)-np/2  as 


(1+F)  np/2=  1-yF+Y(Y+1)F2/2-y(Y+1) (Y+2)F3(l+0F)~Y”3/6  (3.12) 


where  0  <  6  <  1  and  y  =  np/2,  we  compute,  using  (3.11)  and 


the  above  facts. 


/  F  v ( dH. )  v(dH.)  = 

CXp^  x  0(p2) 


(3.13) 


(tr  Axl)(tr  Sn)/Pl  +  (tr  A22 )  ( t  r  ( S22  +  V)/p2) 


tr  S  +  tr  V 


2  .  ,  . i  trSll 


{trUf.-A.-A-.)}  — ii+(trA"  )(tr— )  + 


i.muu  12  21' 1  p,  1  22  p 

+  e  r - = - — 

L  tr  S  +  tr  V 


2  (S  2^V)  (trA21A12)(trS22) 
22)(tr  p2  )  +  p2 


-.V'.'lv v:\-  v  \  .  .  V 


A  -*. 


♦  o(e  ) 


/  F2v(dH1)v(dH2) 
CKpj  x0(p2) 


-2  2 ,3(triii)trSn 

(“st”v>  £  <— l(p-— U  * 


3(tr  l\2)  tr(S22  +  V)2 


P2(P2  +  1) 


4  /  (tr  A, _H-S7, ) 2  v(dH  )  v(dH,)  + 

0( Px  )  x  0(P2) 


tr(S22  +  V)  ,  2 
+  2  ( tr  An)(tr  S11)(tr  A22)  - -  }  +  0(0 


It  therefore  follows  that  for  quite  general  local 
alternatives  of  the  type  £  =  Ip  +  eA  considered  above,  the 


ratio  — =—  is  expressed  as 

dPH 

H0 


I  -in- 11  -1 

r^lll  2  \U7\  2 [ i+y£ 


trS, ,  tr ( S27+V) 

<trAll^-?T^)  +  <trA22)— T2 - 

tr  S  +  tr  V 


+  o(e) ] 


v  v  ■  A*.v .v V  v.v*.  ."  v.v.v  \v>  v.%/VV. 


a 


This  means  that  the  LBI  test  for  a  specific  A  can  be  obtained 


This  test  rejects  Hn  when 


( trS, ,  )  tr(S.,4-V) 

ttrAll)-^-^trA22)— pf— 

tr  S  +  tr  V 


large.  However,  because  of  its  dependence  on  A,  this  result 
is  not  very  useful. 

On  the  other  hand,  if  we  consider  a  subclass  of 

_  tr  An  tr  A22 

alternatives  of  the  form  1  =  Ip  +  cA  with  — — —  =  — — — 

then  the  coefficient  of  e  in  R.H.S.  of  (3.15)  becomes 

a  constant  and  it  becomes  necessary  to  look  into  the 

coefficient  of  e^.  This  is  readily  available  from  the 

previous  calculations  and  yields  the  following  expression  of 


the  ratio 


1  =  ! £ ! ~ ( n~l ) / 2  |£  I'1/ 2  [l  +  Ye 


tr  A. 


(3.16 


2  r ,tr(All_  A12A21(tr  Sll) 


(tr  A‘2)tr(S22  +  V) 


tr  S. 


_!  3(tr  AX1) (tr  Su) 


+  tr(A2iAi2)  +  trv)  +  <— pl(P;-— 
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3(trA22)(tr(S22+V)2) 

p2(p2  +  1] 


+  2(trAxl) (trS11) (trA22) (tr(S22+V)  )/P1P2 


+  4  J  (tr  A12H2S21H')2v(dH1)v(dH2) ) (trS+trV)-2}  +  o(e2) 

0(P1)jc0(p2) 


The  locally  best  invariant  test  statistic  against  such 
specific  local  alternatives  thus  turns  out  to  be  the  co¬ 
efficient  of  in  the  R.H.S.  of  (3.16).  Unfortunately 

this  again  depends  heavily  on  the  fixed  A.  In  the  case  when 

o 

Al2  =  0,  the  coefficient  U(say)  of  e  simplifies  to 


(trA2  )(trSn)  (trA2  )(tr(S„+V)  )  , 

U  ■  { - - - —  +  - - — -  }  ( trS  +  trV) 


Pi 


P2 


(3.17 


3 ( tr  A2x)(tr  S2^  3(tr  A22)(tr(S22  +  V)2) 


P^Pi  +  1) 


P2(P2  +  x) 


+  2(trA11) (trA22) (trSxl) {tr(S22+  V) )/P]p2} { trS  +  trV) 


-2 


This  still  depends  on  A^  and  A22.  If  we  restrict  An  and 
A22  to  satisfy 


tr  A 


11 


tr  A 


22 


tr 


Pi 


tr 


P2 


(3.18 


and  tr  A^  =  Kx  tr  A^2  =  K2(tr  An)(trA22) 

for  ,  K2  >  0  known,  then  LBI  test  statistic  turns  out  to 


3 ( trSf , ) 


lP1<P1+1> 


tr ( ( S--+V)  )  2 ( trS, , ) tr ( S--+V) 


K1  p2(p2  +  1)  P1  p2  K2 


( trS  +  trV) 


The  preceding  analysis  can  be  summarized  as  follows. 


o  n 

Theorem  3.1:  For  testing  H„ :  7  *  a  I  vs  H, :  T  =  a  1 
-  0  u  *•  p  1  L  ] 


+  e  0  ,  €  > 

0  a22 


0  small,  in  the  model  (2.1)-(2.2), 


the  test  which  rejects  Hq  for  large  V  is  LBI  provided  An 
and  A22  satisfy  (3.18). 


Remark  3.2:  The  testing  problem  mentioned  in  Theorem  3.1 
can  be  regarded  as  testing  sphericity  against  independence 


Remark  3.3:  It  is  interesting  to  observe  that  while  the 
absence  of  V  in  (3.6)  makes  the  corresponding  analysis 
smooth  and  leads  to  an  LBI  test  against  very  general 
local  alternatives,  its  presence  changes  the  problem 
such  drastically.  The  test  statistic  V  is  not  all 


that  desirable  because  it  fails  to  use  the  covariance 


component  Si2* 


Remark  3.4:  A  reasonable  test  for  this  problem  would  be 

-  o 

tr  T 

to  reject  HA  for  large  values  of  W  =  - P  where 

0  (tr  TT 


Sii 

s12 

S21 

S22+V 

This  is  a  generalization  of  the 


locally  optimum  test  statistic  - =■  when  V  =  0  to  the  case 

(tr  S)2 

when  V  prevails.  It  is  possible  that  for  some  specific  A 
with  A]_2  t  w  may  turn  out  to  be  the  LBI  test  statistic. 


Remark  3.5:  It  is  not  difficult  to  evaluate  the  integral 

/  (tr  Ai2H2S21Hl'  v  ( dH,  )  v  ( c3H2  )  which  appears  in 

OtPj.)  x  0(p2) 

(3.14)  and  (3.16).  Following  as  in  Kariya  (1978),  and  Eaton 
and  Kariya  (1983),  it  can  be  shown  that 


f  (tr  ^12h2S21H1 5  2  v(dH1)v(dH2)=c(trA12A21) (trS21S12) 

x  0(P2) 


where 


(tr  H3Hp2v(dH1)v(dH3)  . 
0(p1)  x  0(p1) 


.  vW 
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